Towards Cohesive Anomaly Mining
نویسندگان
چکیده
In some applications, such as bioinformatics, social network analysis, and computational criminology, it is desirable to find compact clusters formed by a (very) small portion of objects in a large data set. Since such clusters are comprised of a small number of objects, they are extraordinary and anomalous with respect to the entire data set. This specific type of clustering task cannot be solved well by the conventional clustering methods since generally those methods try to assign most of the data objects into clusters. In this paper, we model this novel and application-inspired task as the problem of mining cohesive anomalies. We propose a general framework and a principled approach to tackle the problem. The experimental results on both synthetic and real data sets verify the effectiveness and efficiency of our approach.
منابع مشابه
Identification of Ti- anomaly in stream sediment geochemistry using of stepwise factor analysis and multifractal model in Delijan district, Iran
In this study, 115 samples taken from the stream sediments were analyzed for concentrations of As, Co, Cr, Cu, Ni, Pb, W, Zn, Au, Ba, Fe, Mn, Sr, Ti, U, V and Zr. In order to outline mineralization-derived stream sediments, various mapping techniques including fuzzy factor score, geochemical halos and fractal model were used. Based on these models, concentrations of Co, Cr, Ni, Zn, Ba, Fe, Mn, ...
متن کاملA hybrid-logic approach towards fault detection in complex cyber-physical systems
Existing data mining approaches to complex systems anomaly detection use uni-variate and/or multi-variate statistical hypothesis testing to assign anomaly scores to data streams associated with system components. The former approach assumes statistical independence of individual components, while the latter assumes substantial global systemic correlation. As a compromise between these two epist...
متن کاملLocal multivariate outliers as geochemical anomaly halos indicators, a case study: Hamich area, Southern Khorasan, Iran
Anomaly recognition has always been a prominent subject in preliminary geochemical explorations. Among the regional geochemical data processing, there are a range of statistical and data mining techniques as well as different mapping methods, which serve as presentations of the outputs. The outlier’s values are of interest in the investigations where data are gathered under controlled condition...
متن کاملIdentification of Data Cohesive Subsystems Using Data Mining Techniques
The activity of reengineering and maintaining large legacy systems involves the use of design recovery techniques to produce abstractions that facilitate the understanding of the system. In this paper, we present an approach to design recovery based on data mining. This approach derives from the observation that data mining can discover unsuspected non-trivial relationships among elements in la...
متن کاملInvestigations of the Material Composition of Iron-containing Tails of the Enrichment of the Mining and Processing Combines of the Kursk Magnetic Anomaly of Russia
The inevitable depletion of mineral resources, the constant deterioration of the geological and mining conditions for the development of mineral deposits and the restoration of raw materials from mining waste by recycling are all urgent problems we face today. The solution to this problem may ensure: a considerable extension of raw material source; decrease of investments in opening new deposit...
متن کامل